The Best 395 Image Segmentation Tools in 2025
Clipseg Rd64 Refined
Apache-2.0
CLIPSeg is an image segmentation model based on text and image prompts, supporting zero-shot and one-shot image segmentation tasks.
Image Segmentation
Transformers

C
CIDAS
10.0M
122
RMBG 1.4
Other
BRIA RMBG v1.4 is an advanced background removal model designed for efficiently separating foreground and background in various types of images, suitable for non-commercial use.
Image Segmentation
Transformers

R
briaai
874.12k
1,771
RMBG 2.0
Other
The latest background removal model developed by BRIA AI, capable of effectively separating foreground and background in various images, suitable for large-scale commercial content creation scenarios.
Image Segmentation
Transformers

R
briaai
703.33k
741
Segformer B2 Clothes
MIT
SegFormer model fine-tuned on ATR dataset for clothing and human segmentation
Image Segmentation
Transformers

S
mattmdjaga
666.39k
410
Sam Vit Base
Apache-2.0
SAM is a vision model capable of generating high-quality object masks from input prompts (such as points or boxes), supporting zero-shot segmentation tasks
Image Segmentation
Transformers Other

S
facebook
635.09k
137
Birefnet
MIT
BiRefNet is a deep learning model for high-resolution binary image segmentation, which achieves accurate image segmentation through a bilateral reference network.
Image Segmentation
Transformers

B
ZhengPeng7
626.54k
365
Segformer B1 Finetuned Ade 512 512
Other
SegFormer is a Transformer-based semantic segmentation model fine-tuned on the ADE20K dataset, suitable for image segmentation tasks.
Image Segmentation
Transformers

S
nvidia
560.79k
6
Sam Vit Large
Apache-2.0
SAM is a visual model capable of generating high-quality object masks from input points or bounding boxes, with zero-shot transfer capability.
Image Segmentation
Transformers Other

S
facebook
455.43k
28
Face Parsing
Semantic segmentation model fine-tuned from nvidia/mit-b5 for face parsing tasks
Image Segmentation
Transformers English

F
jonathandinu
398.59k
157
Sam Vit Huge
Apache-2.0
SAM is a vision model capable of generating high-quality object masks based on input prompts, supporting zero-shot transfer to new tasks
Image Segmentation
Transformers Other

S
facebook
324.78k
163
Mask2former Swin Large Cityscapes Semantic
Other
A large-scale Mask2Former model based on the Swin backbone network, specifically trained for Cityscapes semantic segmentation tasks, adopting a unified architecture for various image segmentation tasks.
Image Segmentation
Transformers

M
facebook
296.33k
24
Mask2former Swin Large Ade Semantic
Other
A large-scale version based on the Swin backbone network, trained on the ADE20k semantic segmentation dataset, employing a unified paradigm for image segmentation tasks.
Image Segmentation
Transformers

M
facebook
238.92k
15
Sam2.1 Hiera Large
Apache-2.0
SAM 2 is a foundational model for promptable visual segmentation in images and videos developed by FAIR, supporting universal segmentation tasks through prompts.
Image Segmentation
S
facebook
203.27k
81
Segformer B0 Finetuned Ade 512 512
Other
SegFormer is a Transformer-based semantic segmentation model fine-tuned on the ADE20k dataset, suitable for 512x512 resolution image segmentation tasks.
Image Segmentation
Transformers

S
nvidia
179.04k
156
Chest X Ray Basic
This model performs simultaneous segmentation and classification tasks on chest X-rays, including lung/heart segmentation, position recognition, and age/gender prediction.
Image Segmentation
Transformers

C
ianpan
175.20k
1
Oneformer Coco Swin Large
MIT
OneFormer is the first multi-task universal image segmentation framework, achieving semantic segmentation, instance segmentation, and panoptic segmentation tasks with a single model
Image Segmentation
Transformers

O
shi-labs
165.70k
3
Sam2 Hiera Large
Apache-2.0
A foundational model for promptable visual segmentation in images and videos developed by FAIR
Image Segmentation
S
facebook
155.85k
68
Mask2former Swin Tiny Coco Instance
Other
A mini version of the Mask2Former instance segmentation model trained on the COCO dataset, utilizing the Swin backbone network to handle segmentation tasks uniformly
Image Segmentation
Transformers

M
facebook
149.85k
7
Oneformer Ade20k Swin Large
MIT
OneFormer is the first multi-task universal image segmentation framework that supports semantic segmentation, instance segmentation, and panoptic segmentation tasks with a single model.
Image Segmentation
Transformers

O
shi-labs
141.57k
24
Birefnet HR Matting
MIT
BiRefNet is a high-resolution binary image segmentation model based on bilateral reference, specifically designed for high-resolution transparent image matting.
Image Segmentation
B
ZhengPeng7
141.30k
2
Segformer B3 Clothes
MIT
SegFormer model fine-tuned on the ATR dataset, primarily used for clothing segmentation and also applicable to human body segmentation
Image Segmentation
Transformers

S
sayeed99
102.42k
23
Mit B0
Other
SegFormer is a Transformer-based semantic segmentation model featuring a hierarchical encoder and lightweight MLP decoder design, excelling in benchmarks like ADE20K and Cityscapes.
Image Segmentation
Transformers

M
nvidia
83.99k
35
Segformer B3 Fashion
Other
A fashion item image segmentation model based on SegFormer architecture, specifically designed for identifying and segmenting clothing and accessories
Image Segmentation
Transformers

S
sayeed99
75.65k
21
Oneformer Cityscapes Dinat Large
MIT
A multi-task universal image segmentation model trained on the Cityscapes dataset, supporting semantic segmentation, instance segmentation, and panoptic segmentation tasks
Image Segmentation
Transformers

O
shi-labs
70.19k
0
Mask2former Swin Tiny Cityscapes Semantic
Other
Mask2Former is a unified image segmentation framework capable of handling instance segmentation, semantic segmentation, and panoptic segmentation tasks. This model is based on the Swin-Tiny backbone network and has been fine-tuned for semantic segmentation on the Cityscapes dataset.
Image Segmentation
Transformers

M
facebook
55.98k
3
Anzhcs YOLOs
A series of object detection and segmentation models trained based on the YOLOv8 and YOLOv11 architectures, focusing on artistic image processing
Image Segmentation Other
A
Anzhc
48.07k
44
Mask2former Swin Base Coco Panoptic
Other
The Mask2Former model based on the Swin backbone network, trained on the COCO panoptic segmentation dataset, adopts a unified paradigm to handle instance segmentation, semantic segmentation, and panoptic segmentation tasks.
Image Segmentation
Transformers

M
facebook
45.01k
14
Segformer B2 Finetuned Ade 512 512
Other
SegFormer is a Transformer-based semantic segmentation model fine-tuned on the ADE20k dataset, suitable for image segmentation tasks at 512x512 resolution.
Image Segmentation
Transformers

S
nvidia
44.07k
3
Upernet Convnext Small
MIT
UperNet is a framework for semantic segmentation that uses ConvNeXt as its backbone network, enabling pixel-level semantic label prediction.
Image Segmentation
Transformers English

U
openmmlab
43.31k
31
Segformer B5 Finetuned Ade 640 640
Other
SegFormer is a Transformer-based semantic segmentation model fine-tuned on the ADE20k dataset, suitable for image segmentation tasks.
Image Segmentation
Transformers

S
nvidia
42.32k
39
Sam2 Hiera Tiny
Apache-2.0
SAM 2 is a foundational model for promptable visual segmentation in images and videos developed by FAIR, supporting efficient segmentation through prompts.
Image Segmentation
S
facebook
41.88k
20
Mask2former Swin Large Coco Panoptic
Other
A large-scale version of Mask2Former based on the Swin backbone network, specifically trained for panoptic segmentation tasks on the COCO dataset
Image Segmentation
Transformers

M
facebook
37.67k
30
Mask2former Swin Large Coco Instance
Other
Mask2Former is a Transformer-based unified image segmentation model, utilizing a Swin-Large backbone and fine-tuned on the COCO dataset, specializing in instance segmentation tasks.
Image Segmentation
Transformers

M
facebook
37.31k
6
Birefnet HR
MIT
BiRefNet is a bilateral reference framework model for high-resolution binary image segmentation, focusing on background removal and mask generation tasks.
Image Segmentation
B
ZhengPeng7
35.07k
62
Segformer B5 Finetuned Cityscapes 1024 1024
Other
A SegFormer semantic segmentation model fine-tuned on the CityScapes dataset at 1024x1024 resolution, featuring a hierarchical Transformer encoder and a lightweight all-MLP decoder head architecture.
Image Segmentation
Transformers

S
nvidia
31.18k
24
RADIO L
AM-RADIO is a visual foundation model developed by NVIDIA Research, featuring an aggregated architecture for unified multi-domain representation, suitable for various computer vision tasks.
Image Segmentation
Transformers

R
nvidia
23.27k
8
Upernet Convnext Large
MIT
UperNet is a semantic segmentation framework combined with the ConvNeXt large backbone network for pixel-level semantic label prediction.
Image Segmentation
Transformers English

U
openmmlab
23.09k
0
Segformer B1 Finetuned Cityscapes 1024 1024
Other
This SegFormer model is fine-tuned on the CityScapes dataset at 1024x1024 resolution, featuring a hierarchical Transformer encoder and lightweight all-MLP decoder head architecture.
Image Segmentation
Transformers

S
nvidia
20.27k
17
Slimsam Uniform 77
Apache-2.0
SlimSAM is an innovative SAM model compression method that efficiently reuses pre-trained SAM through a unified pruning-distillation framework, eliminating the need for extensive repeated training.
Image Segmentation
Transformers Other

S
Zigeng
18.82k
24
Sam2 Hiera Base Plus
Apache-2.0
SAM 2 is a foundational model for promptable visual segmentation in images and videos developed by FAIR, supporting efficient segmentation through prompts.
Image Segmentation
S
facebook
18.17k
6
Mask2former Swin Small Coco Instance
Other
Mask2Former is a unified image segmentation model based on Transformer, fine-tuned on the COCO dataset for instance segmentation tasks
Image Segmentation
Transformers

M
facebook
17.51k
7
Mit B5
Other
SegFormer is a Transformer-based semantic segmentation model. This version only includes the encoder pretrained on ImageNet-1k.
Image Segmentation
Transformers

M
nvidia
15.94k
9
- 1
- 2
- 3
- 4
- 5
- 6
- 10